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the norm of martingales in 2-smooth Banach spaces are improved. Some of 
these improvements hold even for sums of independent real-valued random 
variables. Applications to concentration of measure on product spaces for 
separately Lipschitz functions are presented, including ones concerning the 
central moments of the norm of the sums of independent random vectors 
in any separable Banach space. 

AMS 2010 subject classifications: 60E15, 60B11. 

Keywords and phrases: probability inequalities, bounds on moments, 
Rosenthal inequality, 2-smooth Banach spaces, Hilbert spaces, martingales, 
sums of independent random variables, concentration of measure, sepa- 
rately Lipschitz functions, product spaces. 



1. Introduction, summary, and discussion 

Let {Si)°^Q be a martingale in a separable Banach space (X, || • ||) relative to 
a filter {J^i)^Q of cr-algebras. Assume that 5*0 = and the differences Xi := 
Si — Si-i satisfy the condition 

E.-i\\x,r^b^ (1) 

almost surely for all i G l,oo, where Ei_i denotes the conditional expectation 
given J^i-i and the bi's are some positive real numbers. Here and in what follows, 
for any m and n in Z U {oo} we let m, n := {i G Z : m ^ i ^ n}. Also introduce 



af^:=E||X,|l*, AW := Y^a,{t), and B,, := Y^bj 



for all real i ^ 0, i € l,cx), and n G 0,cxd; as usual, assume that the sum and 
product of an empty family are and 1, respectively; let also O" := 1. 

Assume further that the Banach space (X, || • |j) is 2-smooth (or, more exactly, 
(2, _D)-smooth, for some D = D(X) > O), in the following sense: 

\\x + yr + \\x-yfi:2\\xr + 2D^\\yr (2) 
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for all X and y in X. The importance of the 2-smooth spaces was elucidated 
in [25]: they play the same role with respect to the vector martingales as the 
spaces of type 2 do with respect to the sums of independent random vectors. 
The definition of 2-smooth spaces assumed in this paper is the same as that 
in [17], which is slightly different from the one given in [25] - which latter 
required only that (2) hold for an equivalent norm; the reason for the modified 
definition is that we would like to follow the dependence of certain constants on 
D, the constant of 2-smoothness. Substituting Xx for y in (2), where A € M, one 
observes that necessarily D ^ 1, except for X = {0}. As shown in [5,17], the 
space LP(T, A, ly) is (2, ^/p — 1 )-smooth for any p ^ 2 and any measure space 
(T, u). In particular, what is obvious and well-known, if X is a Hilbert space 
then it is (2,l)-smooth. 



(3) 



Theorem 1. Take any real i ^ and n G 0,oo. Then 

where m :— [|J, J'nj denotes the set of all subsets of the set l,n of cardinality 
j, Hn{J) min J if J ^9, Mn(0) := " + 1, 



^(t) _ t-2j-2 + D^ -p. {t-2k)it-2k-2 + D^)p{t-2k) 



^''= t-2j-l^^ ^^'-'^^ IL 2 

feeoj"-i 



~(t) ._ -Q {t-2j){t-2j-2 + D^)p{t-2j) 



a 



jeo.m-i 

and p{-) and q{-) are any functions such that 

p(2)+g(2)^l, p(2)>0, g(2)>0, 
p{s) ^ 1 and q{s) ^ 1 for all s > 2, (4) 
p(5)i/(3-«) + 5(s)i/(3-«) ^ 1 all s > 3. 

Almost the same result was obtained in [23] in the special case when X is 
a Hilbert space (in which case one can take D = 1) - except that, in place of 
the term (j)-i (3) above, the bound in [23] contained the larger term 

4(2) "(2) 

The proofs, whenever necessary, are deferred to Section 3. 

Corollary 2. In the conditions of Theorem 1, let t > 2 and take any positive 
real numbers Xj for all j d 0,m — 1. Then 

ei|5„1I*<4*)a(*) + c1,*)s*, (5) 
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whe 



Cf:= y cf'-^^^ and 
jeo.m-i 1 ■' 



(6) 



In the special case when X is a Hilbert space, a similar but somewhat less pre- 
cise bound was obtained in [23, Corollary]. Namely, in place of the terms *~ j^^^ , 
' ('^)' bound in [23, Corollary] contains the larger terms 

1, J, and 1, respectively. 

Results somewhat similar to Corollary 2 were also obtained in [17], by rather 
different methods. Particularly, [17, Theorem 4.1] implies the "spectrum" of 
inequalities 

E\\Sn\\' < i^*(c*Al*) +c*/2e*'/"i?*B^) (7) 

for some positive absolute constant K and all real < ^ 2, depending on the freely 
chosen value of the "balancing" parameter c G [1,^]. One can use this freedom 
to minimize or quasi- minimize the bound in (7) in c € [1,^], depending on the 
value of the ratio An'' /B!^ - as is done in [17]; see Theorem 6.1 and the definition 
of B* on page 1693 there. In fact, [17, Theorems 6.1 and 6.2] show that 

(i) for each i > 2 and each value of the ratio An'' /B^^, the minimum in c e [1, t] 
of the upper bound in (7) is an optimal (up to a factor of the form K^) 
upper bound on E [] Sn II ; 

(ii) the spectrum of the bounds in (7) is also "minimal" in the sense that 
for each c € [l,t] there exists a value of the ratio An^ /B^^ such that the 
corresponding "individual" bound in (7) is the best possible (again up to 
a factor of the form if*); 

(iii) the above statements (i) and (ii) hold if the term(s) An '' and/or _B,^ are/is 

t /2 

replaced, respectively, by Emax-^j^ ll-^^ill* and/or E { ^i-i WXiW^) 

One can similarly use the balancing parameters Xj in (6) to minimize or 
quasi-minimize the bound in (5). Moreover, by Remark 6.8 in [17] (with details 
given by [21, Proposition 9.2]), the minimum of the bound on E||S'„||* in [23, 
Theorem 1] with respect to the corresponding balancing parameters is equivalent 
(once again up to a factor of the form K*) to the minimum in c S [l,i] of the 
bound in (7) - at least when the condition (1) holds; hence, the same is true for 
the bounds in (5) and (3), which therefore possess the same up-to-the-_fi'*-factor 
optimality property. 

Thus, the main advantage of inequality (7) over (5) and (3) is that the former 
does not require the condition (1). On the other hand, the bounds in (5) and 
(3) are quite explicit and do not contain unspecified constants such as K in 
(7). If one follows the hnes of the proof of (7) in [17] without serious efforts at 
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modification, the resulting value of K turns out to be large, namely equal 120, 
quite in contrast with the constant factors in (5) and (3) - cf. e.g. the bound in 
(12) below, which is a particular case of (5). Moreover, (5) and (3) appear to 
provide a better dependence of the bounds on the 2-smoothness constant D. 

As for the condition (1), it turns out quite naturally satisfied in applications 
to concentration of measure on product spaces for separately Lipschitz functions 
and, in particular, for the norm of the sums of independent random vectors - 
cf. e.g. [20] and further bibliography there. Such an application is provided in 
Section 2 of the present note. 

Special cases of Corollary 2 are the following inequalities: 

^\\Sn\\' ^^-^^^^{MP + it-l)Bi) foraU te(2,3] and (8) 
E||5„ir^^^^(^ + (t-l)^^^L) foraU ie[3,4] (9) 

and a £ (0, 1). Minimizing the upper bound in (9) in a €E (0, 1), one can combine 
(8) and (9) into the inequality 

E||^nir^ ''/_+^' [(AW)^^"+(^-l)V-'i3,y-f' foraU <e(2,4], (10) 

where St := max(l,i — 2). 

Note that the coefficient of An^ in (8) cannot be less than 1, whatever the 
coefficient of in (8) may be; for instance, take n = 1 and let Xi be such that 
P(Xi = 0) = 1 - 2p and P{Xi = 1) = P{Xi = -1) = p, with p i 0. So, the 
coefficient of aI*-* in (8) takes the optimal value 1 when D — I. 

Consider the now the case when the latter condition holds, that is, when X is 
a Hilbert space. Then the behavior of the upper bound in (10) depends mainly 
on the ratio r := rn^ := J^n If r — 7> oo (which happens e.g. in the situation 

described in the previous paragraph) then the upper bound in (10) is asymptotic 
to J^n and thus is asymptotically optimal. If r J, (which happens e.g. in the 
important case when n — >■ cxd, the XiS are iid, t is fixed, and E < oo) then 
the upper bound in (10) is asymptotic to (t — l)i?Ji. On the other hand, by the 
central limit theorem, this upper bound cannot be less than E |Z|*i3^, where Z 
is a standard normal r.v. Since E |Z|* = t — 1 for t e {2,4}, it follows that the 
upper bound in (10) is asymptotically optimal when r J, and at that either 
t 4, 2 or i — ^ 4. The graph of the ratio of t — 1 to E |Z|* in Figure 1 shows that the 
asymptotic coefficient t — 1 at B\ in (10) for r | is rather close to optimality 
for aU t e (2,4]. 

Again when X is a Hilbert space, (8) and (9) with a = ^ yield 

E||5„||*s^2(*-3)+(^(t) + (i_i)stJ for all ie(2,4], (11) 

where :— max(0,u). This latter result was obtained, by a more Stein-like 
method, in [4, Lemma 6.3] in the case when X = M, i £ (2,3], and the X^'s are 
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ratio 
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0.4 - 
0.2 - 

2.5 3.0 3.5 4.0 ' 

Fig 1 . Graph of the ratio of t - 1 to E\Z\* for t e (2, 4] , 

independent. In the case when X = E, < = 3, and the X^'s are iid, inequahty 
(11) was stated without proof in [16, page 341]; in the more general case when 
X = M., t G (2,4], and the Xi's are independent but not necessarily identically 
distributed, the bound in (11) was obtained in [15, Lemma 13], but with the 
larger factor t(t ~ 1)2^*^^ in place of t — 1. 

In the case t = 3, which is particularly important in applications to Berry- 
Esseen-type bounds (see e.g. [4,22]), (8) or (9) yields 

E||^„||^^i^(4f)+2B^). (12) 

Rosenthal-type inequalities and related results can be found, among other 
papers, in [2,3,6-9,14,18,24,26,27]. 

2. Application to concentration of measure on product spaces for 
separately Lipschitz functions 

In this section, let us re-define Xi, . . . ,X„ to be independent r.v.'s with values 
in measurable spaces Xi, . . . , respectively. Let 5: Cp — > M be a measurable 
function on the product space *p := Xi x • • • x X„. Let us say (cf. [1, 19]) that 
g is separately Lipschitz if it satisfies a Lipschitz-type condition in each of its 
arguments: 

\g{xi, . . .,Xi-i,Xi,Xi+i,. ..,Xn)- g{xi, . . . ,a;„)| < pi{xi,Xi) (13) 

for some measurable functions : Xi x Xi — > M and alH G 1, rt, [xi, . . . ,Xn) G *P, 
and ii € Xi. Take now any separately Lipschitz function g and let 

r :=5(Xi,...,X„). 
Suppose that the r.v. Y has a finite mean. Take any real t > 2. 
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Corollary 3. For each i G take any Xi and in X.;. Then 

E\Y-EY\' ^CtCfY.^p,{X,,x,Y + c'j^'>(^Y.^p,{X,,y,f) , 
1 1 

where c'^^ and c'jp are as in (6) with D — 1, 

Ct :^R{t,bt), 

and bt is the unique maximizer of 

R{t,b) := + (1 - + (1 - 

over all b € [0, |]. 

This follows immediately from Corollary 2 and [20, Corollary 3.1]. 
An example of separately Lipschitz functions g : — >■ K is given by the 
formula 

g{xi,. ..,Xn)^ \\X1 H h Xn\\ 

for all xi,...,Xn in any (not necessarily 2-smooth) separable Banach space 
(X, II • II). In this case, one may take pi{xi,Xi) = \\xi—Xi\\. Thus, one immediately 
obtains 

Corollary 4. Let Xi, . . . , be independent random vectors in a separable 
Banach space (X, || • ||). Let here Y := \\Xi + • • • + Xn\\. For each i € l,n, take 
any Xi and yi in X. Then, with C'^\ Cg\ and C't as in Corollary 3, 

E |y - EFi* ^ Ct c'i^ Y.^\\x,-x,r + 4*' ( J2 E 11^. - y^r) ■ 

1 1 
In particular, C3 < 1.316 and hence, by (12) with D — 1, 

71 n 

E |y - Ey|3 <=: 1.316 ^ E ||X, - ^ 2(^ E ||X, - yj^) ; 
1 1 

this improves the corresponding bound in [20], which had the constant factor 3 
in place of 2. 

The separate-Lipschitz condition (13) is easier to check than a joint-Lipschitz 
one. Also, the former is more generally applicable. On the other hand, when a 
joint-Lipschitz condition is satisfied, one can generally obtain better bounds. 
Literature on the concentration of measure phenomenon, almost all of it for 
joint-Lipschitz settings, is vast; let us mention here only [10-13]. 

3. Proofs 

The proof of Theorem 1 is almost the same as that of [23, Theorem 1]; instead 
of [23, Lemma 1] one should now use the following more general lemma, valid 
for general 2-smooth Banach spaces. 
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Lemma 1. Suppose thatt ^ 2 andp{t) and q{t) satisfy conditions (4). Suppose 
also that the function Q{-) := \\ ■ |p is twice differentiable everywhere on X - 
which may be assumed without loss of generality in view of [17, Remark 2.4]. 
Then for any x and y in X 

\\x+y\\^^M'^{\\x\r^Q\x){y) + '-^i^^ 

Proof of Lemma 1. Take indeed any x and y in X and introduce 

f{0) -.= 11x0^-11x11* ^Q{xe)~\\x\\', 

where 9 e [0,1] and xg := x + Oy, so that /(O) = 0, /'(O) = ^\\x\\*'^Q'{x){y), 
and /(I) = ||a; + y\\* — \\x\\*. Moreover, since X is (2, ZJj-smooth and Q is twice 
differentiable on X, for aU 9 one has Q" {xe){y,y) ^ 2D^||j/||^ and \Q'{xe){y)\ < 
2||xe|||l2/|l (cf. [17, Lemma 2.2 and Remark 2.4]); so, 

f"{9) = 1(1-1) Q{xey'^-\Q'{xe){y)f + \ Q{xer'^-^Q" {xe){y,y) 
^t{t-2 + D^)\\xe\r^y\\^ 
^t{t-2 + D^){p{t)\\x\r^yr + q{t)9'-^yr), 

since {a + /3)*^^ ^ p{t)a*'^^ + q{t)P*-^'^ for all a and /3 in [0, oo). It remains to 
use the identities 



\\x + y\\'- \\xr = /(I) = /(O) + /'(O) + /„\l - 9)f"{9)d9. 



□ 



Proof of Corollary 2. For any j e 0, to — 1 and \j > 



^ (4*))^ (S* ) A ^ ^AT^^- AW + ^ A-^^-^ ; (14) 

the second inequality here follows by the log-convexity of An"* in s, and the 
fourth one follows by Young's inequality x^l'^y^l'^ ^ f ^ f '^'^^ ^' ^' ^ ^'^'^ 
such that a; ^ 0, y > 0, p > 0, (7 > 0, and ^ + | = 1- Next, 

(15) 

the iterated sum here is bounded by induction, using the inequality 

iei.fe 
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for s ^ and k G l,oo; in turn, inequality (16) follows because its left-hand 
side is a left (and hence lower) Riemann sum for the integral a;*' dx of the 
nondecreasing function whereas the right-hand side of (16) is the value of 
the integral. Now Corollary 2 follows by (3), (14), and (15). □ 
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